Integrating Attribute-Based Classification for Answer Set Construction
نویسندگان
چکیده
With the exponential growth of information on the WWW, it is becoming increasingly difficult to find and organize relevant documents. Automatic text classification has been considered as a solution to the problem with its focus mostly on the subject or content of text [1]. Recently, researchers have realized that user information needs are not just based on the subject of a document but also on other properties such as patterns of hyperlinks [2] or text genre [3]. Alternatively, document classification can be used as part of an information retrieval system to improve its efficiency rather than as an independent system [4]. This paper rests on the both directions: a new type of text classification and its use for a retrieval system. The main thrust of our paper is based on our experience in developing and applying an attributebased classification technique, which is in and of itself a new attempt in classification research, in the context of an operational “answer set driven” retrieval system. Our answer set driven retrieval system, like AskJeeves [5], attempts to provide high quality answer documents to user queries by maintaining a knowledge base consisting of expected queries and corresponding answer documents. Since it is still considered difficult, if not impossible, to capture semantics and pragmatics of sentences in user queries and documents, such systems require knowledge bases be built manually so that a certain level of quality is guaranteed. Needless to say, this knowledge base construction process is labor-intensive, typically requiring significant and continuous human efforts [6]. In order to reduce the cost of manually constructing and maintaining answer sets, we have devised a new method of automating the answer document selection process by using the notion of attributed-based classification (ABC).
منابع مشابه
A Framework for Optimal Attribute Evaluation and Selection in Hesitant Fuzzy Environment Based on Enhanced Ordered Weighted Entropy Approach for Medical Dataset
Background: In this paper, a generic hesitant fuzzy set (HFS) model for clustering various ECG beats according to weights of attributes is proposed. A comprehensive review of the electrocardiogram signal classification and segmentation methodologies indicates that algorithms which are able to effectively handle the nonstationary and uncertainty of the signals should be used for ECG analysis. Ex...
متن کاملNeural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten
Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...
متن کاملDeveloping a conceptual model based upon the Latin Hypercube Sampling for integrating OHS into project risk evaluation
Abstract Project management in construction industry, in many cases, is imperfect with respect to the integration of Occupational Health and Safety (OHS) risks. This imperfection exhibits itself as complications affecting the riskiness of industrial procedures and is illustrated usually by poor awareness of OHS within project teams. Difficulties on OHS regularly came about in the construction i...
متن کاملDiscovering Attribute Dependence in Databases by Integrating Symbolic Learning and Statistical Techniques
The paper presents a method for integrating two different types of data analysis: symbolic inductive learning and statistical methods. The method concerns the problem of discovering rules characterizing the dependence between a group of dependent attributes and a group of independent groups, e.g., decision attributes and measurable attributes. The method proceeds along two paths. One paths gene...
متن کاملAttribute-based Access Control for Cloud-based Electronic Health Record (EHR) Systems
Electronic health record (EHR) system facilitates integrating patients' medical information and improves service productivity. However, user access to patient data in a privacy-preserving manner is still challenging problem. Many studies concerned with security and privacy in EHR systems. Rezaeibagha and Mu [1] have proposed a hybrid architecture for privacy-preserving accessing patient records...
متن کامل